Overview

Brought to you by YData

Dataset statistics

Number of variables12
Number of observations11123
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 MiB
Average record size in memory96.0 B

Variable types

Numeric6
Text5
Categorical1

Alerts

ratings_count is highly overall correlated with text_reviews_countHigh correlation
text_reviews_count is highly overall correlated with ratings_countHigh correlation
language_code is highly imbalanced (76.6%) Imbalance
isbn13 is highly skewed (γ1 = -21.06647588) Skewed
bookID has unique values Unique
isbn has unique values Unique
isbn13 has unique values Unique
text_reviews_count has 624 (5.6%) zeros Zeros

Reproduction

Analysis started2025-07-12 23:23:32.666516
Analysis finished2025-07-12 23:23:40.188482
Duration7.52 seconds
Software versionydata-profiling vv4.16.1
Download configurationconfig.json

Variables

bookID
Real number (ℝ)

Unique 

Distinct11123
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21310.857
Minimum1
Maximum45641
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size87.0 KiB
2025-07-12T23:23:40.341228image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1800.1
Q110277.5
median20287
Q332104.5
95-th percentile43067.5
Maximum45641
Range45640
Interquartile range (IQR)21827

Descriptive statistics

Standard deviation13094.727
Coefficient of variation (CV)0.61446273
Kurtosis-1.1465879
Mean21310.857
Median Absolute Deviation (MAD)10884
Skewness0.14401023
Sum2.3704066 × 108
Variance1.7147188 × 108
MonotonicityStrictly increasing
2025-07-12T23:23:40.679594image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
45641 1
 
< 0.1%
1 1
 
< 0.1%
2 1
 
< 0.1%
45585 1
 
< 0.1%
45583 1
 
< 0.1%
45574 1
 
< 0.1%
45572 1
 
< 0.1%
45570 1
 
< 0.1%
45568 1
 
< 0.1%
45564 1
 
< 0.1%
Other values (11113) 11113
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
ValueCountFrequency (%)
45641 1
< 0.1%
45639 1
< 0.1%
45634 1
< 0.1%
45633 1
< 0.1%
45631 1
< 0.1%
45630 1
< 0.1%
45626 1
< 0.1%
45625 1
< 0.1%
45623 1
< 0.1%
45617 1
< 0.1%

title
Text

Distinct10348
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size87.0 KiB
2025-07-12T23:23:41.083799image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length254
Median length139
Mean length35.844287
Min length2

Characters and Unicode

Total characters398696
Distinct characters167
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9861 ?
Unique (%)88.7%

Sample

1st rowHarry Potter and the Half-Blood Prince (Harry Potter #6)
2nd rowHarry Potter and the Order of the Phoenix (Harry Potter #5)
3rd rowHarry Potter and the Chamber of Secrets (Harry Potter #2)
4th rowHarry Potter and the Prisoner of Azkaban (Harry Potter #3)
5th rowHarry Potter Boxed Set Books 1-5 (Harry Potter #1-5)
ValueCountFrequency (%)
the 6688
 
10.1%
of 3334
 
5.0%
and 1651
 
2.5%
a 1335
 
2.0%
1 796
 
1.2%
in 777
 
1.2%
to 697
 
1.1%
588
 
0.9%
2 519
 
0.8%
3 399
 
0.6%
Other values (12079) 49507
74.7%
2025-07-12T23:23:41.841603image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
58852
14.8%
e 36612
 
9.2%
o 23573
 
5.9%
a 22292
 
5.6%
i 20601
 
5.2%
r 20200
 
5.1%
n 20016
 
5.0%
t 19138
 
4.8%
s 16684
 
4.2%
h 13689
 
3.4%
Other values (157) 147039
36.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 398696
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
58852
14.8%
e 36612
 
9.2%
o 23573
 
5.9%
a 22292
 
5.6%
i 20601
 
5.2%
r 20200
 
5.1%
n 20016
 
5.0%
t 19138
 
4.8%
s 16684
 
4.2%
h 13689
 
3.4%
Other values (157) 147039
36.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 398696
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
58852
14.8%
e 36612
 
9.2%
o 23573
 
5.9%
a 22292
 
5.6%
i 20601
 
5.2%
r 20200
 
5.1%
n 20016
 
5.0%
t 19138
 
4.8%
s 16684
 
4.2%
h 13689
 
3.4%
Other values (157) 147039
36.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 398696
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
58852
14.8%
e 36612
 
9.2%
o 23573
 
5.9%
a 22292
 
5.6%
i 20601
 
5.2%
r 20200
 
5.1%
n 20016
 
5.0%
t 19138
 
4.8%
s 16684
 
4.2%
h 13689
 
3.4%
Other values (157) 147039
36.9%
Distinct6639
Distinct (%)59.7%
Missing0
Missing (%)0.0%
Memory size87.0 KiB
2025-07-12T23:23:42.238307image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length751
Median length375
Mean length24.823968
Min length3

Characters and Unicode

Total characters276117
Distinct characters148
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5278 ?
Unique (%)47.5%

Sample

1st rowJ.K. Rowling/Mary GrandPré
2nd rowJ.K. Rowling/Mary GrandPré
3rd rowJ.K. Rowling
4th rowJ.K. Rowling/Mary GrandPré
5th rowJ.K. Rowling/Mary GrandPré
ValueCountFrequency (%)
john 279
 
0.8%
william 262
 
0.8%
james 227
 
0.7%
david 201
 
0.6%
a 191
 
0.6%
robert 185
 
0.5%
j 181
 
0.5%
stephen 176
 
0.5%
richard 157
 
0.5%
m 155
 
0.5%
Other values (12654) 31794
94.0%
2025-07-12T23:23:43.067109image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 23738
 
8.6%
23397
 
8.5%
a 22538
 
8.2%
r 18107
 
6.6%
n 17411
 
6.3%
i 15667
 
5.7%
o 14404
 
5.2%
l 13268
 
4.8%
s 9813
 
3.6%
t 9521
 
3.4%
Other values (138) 108253
39.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 276117
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 23738
 
8.6%
23397
 
8.5%
a 22538
 
8.2%
r 18107
 
6.6%
n 17411
 
6.3%
i 15667
 
5.7%
o 14404
 
5.2%
l 13268
 
4.8%
s 9813
 
3.6%
t 9521
 
3.4%
Other values (138) 108253
39.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 276117
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 23738
 
8.6%
23397
 
8.5%
a 22538
 
8.2%
r 18107
 
6.6%
n 17411
 
6.3%
i 15667
 
5.7%
o 14404
 
5.2%
l 13268
 
4.8%
s 9813
 
3.6%
t 9521
 
3.4%
Other values (138) 108253
39.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 276117
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 23738
 
8.6%
23397
 
8.5%
a 22538
 
8.2%
r 18107
 
6.6%
n 17411
 
6.3%
i 15667
 
5.7%
o 14404
 
5.2%
l 13268
 
4.8%
s 9813
 
3.6%
t 9521
 
3.4%
Other values (138) 108253
39.2%

average_rating
Real number (ℝ)

Distinct209
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.9340753
Minimum0
Maximum5
Zeros25
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size87.0 KiB
2025-07-12T23:23:43.353461image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3.44
Q13.77
median3.96
Q34.14
95-th percentile4.38
Maximum5
Range5
Interquartile range (IQR)0.37

Descriptive statistics

Standard deviation0.35048531
Coefficient of variation (CV)0.089089629
Kurtosis36.222806
Mean3.9340753
Median Absolute Deviation (MAD)0.18
Skewness-3.5774415
Sum43758.72
Variance0.12283995
MonotonicityNot monotonic
2025-07-12T23:23:43.820397image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 219
 
2.0%
3.96 195
 
1.8%
4.02 178
 
1.6%
3.94 176
 
1.6%
4.07 172
 
1.5%
3.93 168
 
1.5%
4.05 168
 
1.5%
3.92 168
 
1.5%
3.83 166
 
1.5%
3.89 166
 
1.5%
Other values (199) 9347
84.0%
ValueCountFrequency (%)
0 25
0.2%
1 2
 
< 0.1%
1.67 1
 
< 0.1%
2 6
 
0.1%
2.33 1
 
< 0.1%
2.4 1
 
< 0.1%
2.5 1
 
< 0.1%
2.55 1
 
< 0.1%
2.61 1
 
< 0.1%
2.62 3
 
< 0.1%
ValueCountFrequency (%)
5 22
0.2%
4.91 1
 
< 0.1%
4.88 1
 
< 0.1%
4.86 1
 
< 0.1%
4.83 1
 
< 0.1%
4.82 1
 
< 0.1%
4.8 1
 
< 0.1%
4.78 2
 
< 0.1%
4.76 1
 
< 0.1%
4.75 2
 
< 0.1%

isbn
Text

Unique 

Distinct11123
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size87.0 KiB
2025-07-12T23:23:44.279288image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9999101
Min length9

Characters and Unicode

Total characters111229
Distinct characters12
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11123 ?
Unique (%)100.0%

Sample

1st row0439785960
2nd row0439358078
3rd row0439554896
4th row043965548X
5th row0439682584
ValueCountFrequency (%)
043965548x 1
 
< 0.1%
8497646983 1
 
< 0.1%
0439785960 1
 
< 0.1%
8466302298 1
 
< 0.1%
8432203238 1
 
< 0.1%
9583006408 1
 
< 0.1%
0061199001 1
 
< 0.1%
972233168x 1
 
< 0.1%
9722332201 1
 
< 0.1%
9722330551 1
 
< 0.1%
Other values (11113) 11113
99.9%
2025-07-12T23:23:44.931749image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 19990
18.0%
1 12568
11.3%
4 11493
10.3%
5 10540
9.5%
3 10379
9.3%
2 9463
8.5%
7 9344
8.4%
8 9101
8.2%
6 9076
8.2%
9 8291
7.5%
Other values (2) 984
 
0.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 111229
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 19990
18.0%
1 12568
11.3%
4 11493
10.3%
5 10540
9.5%
3 10379
9.3%
2 9463
8.5%
7 9344
8.4%
8 9101
8.2%
6 9076
8.2%
9 8291
7.5%
Other values (2) 984
 
0.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 111229
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 19990
18.0%
1 12568
11.3%
4 11493
10.3%
5 10540
9.5%
3 10379
9.3%
2 9463
8.5%
7 9344
8.4%
8 9101
8.2%
6 9076
8.2%
9 8291
7.5%
Other values (2) 984
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 111229
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 19990
18.0%
1 12568
11.3%
4 11493
10.3%
5 10540
9.5%
3 10379
9.3%
2 9463
8.5%
7 9344
8.4%
8 9101
8.2%
6 9076
8.2%
9 8291
7.5%
Other values (2) 984
 
0.9%

isbn13
Real number (ℝ)

Skewed  Unique 

Distinct11123
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.7598802 × 1012
Minimum8.9870598 × 109
Maximum9.7900077 × 1012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size87.0 KiB
2025-07-12T23:23:45.164782image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum8.9870598 × 109
5-th percentile9.7800609 × 1012
Q19.7803455 × 1012
median9.7805825 × 1012
Q39.7808722 × 1012
95-th percentile9.7819322 × 1012
Maximum9.7900077 × 1012
Range9.7810206 × 1012
Interquartile range (IQR)5.2675424 × 108

Descriptive statistics

Standard deviation4.4297585 × 1011
Coefficient of variation (CV)0.045387426
Kurtosis442.47375
Mean9.7598802 × 1012
Median Absolute Deviation (MAD)2.5197677 × 108
Skewness-21.066476
Sum1.0855915 × 1017
Variance1.962276 × 1023
MonotonicityNot monotonic
2025-07-12T23:23:45.361989image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9.788497647 × 10121
 
< 0.1%
9.780439786 × 10121
 
< 0.1%
9.780439358 × 10121
 
< 0.1%
9.788432217 × 10121
 
< 0.1%
9.788466319 × 10121
 
< 0.1%
9.781932395 × 10121
 
< 0.1%
9.781855495 × 10121
 
< 0.1%
9.780573051 × 10121
 
< 0.1%
9.789681907 × 10121
 
< 0.1%
9.781568521 × 10121
 
< 0.1%
Other values (11113) 11113
99.9%
ValueCountFrequency (%)
8987059752 1
< 0.1%
2.004913 × 10101
< 0.1%
2.375500432 × 10101
< 0.1%
3.44060546 × 10101
< 0.1%
4.908600776 × 10101
< 0.1%
7.399914077 × 10101
< 0.1%
7.399925491 × 10101
< 0.1%
7.399976844 × 10101
< 0.1%
7.399996082 × 10101
< 0.1%
7.609202599 × 10101
< 0.1%
ValueCountFrequency (%)
9.790007672 × 10121
< 0.1%
9.789998692 × 10121
< 0.1%
9.789879398 × 10121
< 0.1%
9.789875801 × 10121
< 0.1%
9.789875662 × 10121
< 0.1%
9.78987225 × 10121
< 0.1%
9.789861157 × 10121
< 0.1%
9.789861157 × 10121
< 0.1%
9.789861146 × 10121
< 0.1%
9.789861146 × 10121
< 0.1%

language_code
Categorical

Imbalance 

Distinct27
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size87.0 KiB
eng
8908 
en-US
1408 
spa
 
218
en-GB
 
214
fre
 
144
Other values (22)
 
231

Length

Max length5
Median length3
Mean length3.2928167
Min length2

Characters and Unicode

Total characters36626
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.1%

Sample

1st roweng
2nd roweng
3rd roweng
4th roweng
5th roweng

Common Values

ValueCountFrequency (%)
eng 8908
80.1%
en-US 1408
 
12.7%
spa 218
 
2.0%
en-GB 214
 
1.9%
fre 144
 
1.3%
ger 99
 
0.9%
jpn 46
 
0.4%
mul 19
 
0.2%
zho 14
 
0.1%
grc 11
 
0.1%
Other values (17) 42
 
0.4%

Length

2025-07-12T23:23:45.689637image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
eng 8908
80.1%
en-us 1408
 
12.7%
spa 218
 
2.0%
en-gb 214
 
1.9%
fre 144
 
1.3%
ger 99
 
0.9%
jpn 46
 
0.4%
mul 19
 
0.2%
zho 14
 
0.1%
grc 11
 
0.1%
Other values (17) 42
 
0.4%

Most occurring characters

ValueCountFrequency (%)
e 10787
29.5%
n 10588
28.9%
g 9021
24.6%
- 1629
 
4.4%
U 1408
 
3.8%
S 1408
 
3.8%
p 275
 
0.8%
r 270
 
0.7%
a 231
 
0.6%
s 224
 
0.6%
Other values (16) 785
 
2.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 36626
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 10787
29.5%
n 10588
28.9%
g 9021
24.6%
- 1629
 
4.4%
U 1408
 
3.8%
S 1408
 
3.8%
p 275
 
0.8%
r 270
 
0.7%
a 231
 
0.6%
s 224
 
0.6%
Other values (16) 785
 
2.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 36626
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 10787
29.5%
n 10588
28.9%
g 9021
24.6%
- 1629
 
4.4%
U 1408
 
3.8%
S 1408
 
3.8%
p 275
 
0.8%
r 270
 
0.7%
a 231
 
0.6%
s 224
 
0.6%
Other values (16) 785
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 36626
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 10787
29.5%
n 10588
28.9%
g 9021
24.6%
- 1629
 
4.4%
U 1408
 
3.8%
S 1408
 
3.8%
p 275
 
0.8%
r 270
 
0.7%
a 231
 
0.6%
s 224
 
0.6%
Other values (16) 785
 
2.1%

num_pages
Real number (ℝ)

Distinct997
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean336.40556
Minimum0
Maximum6576
Zeros76
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size87.0 KiB
2025-07-12T23:23:46.104853image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile48
Q1192
median299
Q3416
95-th percentile752
Maximum6576
Range6576
Interquartile range (IQR)224

Descriptive statistics

Standard deviation241.15263
Coefficient of variation (CV)0.7168509
Kurtosis62.415973
Mean336.40556
Median Absolute Deviation (MAD)107
Skewness4.2717781
Sum3741839
Variance58154.589
MonotonicityNot monotonic
2025-07-12T23:23:46.285510image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
288 230
 
2.1%
192 221
 
2.0%
320 218
 
2.0%
256 207
 
1.9%
352 202
 
1.8%
224 198
 
1.8%
208 178
 
1.6%
304 177
 
1.6%
240 173
 
1.6%
384 172
 
1.5%
Other values (987) 9147
82.2%
ValueCountFrequency (%)
0 76
0.7%
1 11
 
0.1%
2 15
 
0.1%
3 19
 
0.2%
4 11
 
0.1%
5 16
 
0.1%
6 20
 
0.2%
7 6
 
0.1%
8 10
 
0.1%
9 11
 
0.1%
ValueCountFrequency (%)
6576 1
< 0.1%
4736 1
< 0.1%
3400 1
< 0.1%
3342 1
< 0.1%
3020 1
< 0.1%
2751 1
< 0.1%
2690 1
< 0.1%
2480 1
< 0.1%
2264 1
< 0.1%
2198 1
< 0.1%

ratings_count
Real number (ℝ)

High correlation 

Distinct5294
Distinct (%)47.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17942.848
Minimum0
Maximum4597666
Zeros80
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size87.0 KiB
2025-07-12T23:23:46.488521image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile8
Q1104
median745
Q35000.5
95-th percentile61114
Maximum4597666
Range4597666
Interquartile range (IQR)4896.5

Descriptive statistics

Standard deviation112499.15
Coefficient of variation (CV)6.2698605
Kurtosis442.27167
Mean17942.848
Median Absolute Deviation (MAD)728
Skewness17.693952
Sum1.995783 × 108
Variance1.265606 × 1010
MonotonicityNot monotonic
2025-07-12T23:23:46.683089image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 82
 
0.7%
0 80
 
0.7%
1 76
 
0.7%
2 71
 
0.6%
4 71
 
0.6%
5 61
 
0.5%
9 60
 
0.5%
8 59
 
0.5%
6 57
 
0.5%
7 56
 
0.5%
Other values (5284) 10450
93.9%
ValueCountFrequency (%)
0 80
0.7%
1 76
0.7%
2 71
0.6%
3 82
0.7%
4 71
0.6%
5 61
0.5%
6 57
0.5%
7 56
0.5%
8 59
0.5%
9 60
0.5%
ValueCountFrequency (%)
4597666 1
< 0.1%
2530894 1
< 0.1%
2457092 1
< 0.1%
2418736 1
< 0.1%
2339585 1
< 0.1%
2293963 1
< 0.1%
2153167 1
< 0.1%
2128944 1
< 0.1%
2111750 1
< 0.1%
2095690 1
< 0.1%

text_reviews_count
Real number (ℝ)

High correlation  Zeros 

Distinct1822
Distinct (%)16.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean542.0481
Minimum0
Maximum94265
Zeros624
Zeros (%)5.6%
Negative0
Negative (%)0.0%
Memory size87.0 KiB
2025-07-12T23:23:46.866535image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q19
median47
Q3238
95-th percentile2158.9
Maximum94265
Range94265
Interquartile range (IQR)229

Descriptive statistics

Standard deviation2576.6196
Coefficient of variation (CV)4.7534888
Kurtosis396.56506
Mean542.0481
Median Absolute Deviation (MAD)45
Skewness16.175096
Sum6029201
Variance6638968.5
MonotonicityNot monotonic
2025-07-12T23:23:47.052404image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 624
 
5.6%
1 458
 
4.1%
2 354
 
3.2%
3 263
 
2.4%
4 247
 
2.2%
5 223
 
2.0%
6 199
 
1.8%
7 180
 
1.6%
9 164
 
1.5%
8 162
 
1.5%
Other values (1812) 8249
74.2%
ValueCountFrequency (%)
0 624
5.6%
1 458
4.1%
2 354
3.2%
3 263
2.4%
4 247
 
2.2%
5 223
 
2.0%
6 199
 
1.8%
7 180
 
1.6%
8 162
 
1.5%
9 164
 
1.5%
ValueCountFrequency (%)
94265 1
< 0.1%
86881 1
< 0.1%
56604 1
< 0.1%
55843 1
< 0.1%
52759 1
< 0.1%
47951 1
< 0.1%
47620 1
< 0.1%
46176 1
< 0.1%
43499 1
< 0.1%
36325 1
< 0.1%
Distinct3679
Distinct (%)33.1%
Missing0
Missing (%)0.0%
Memory size87.0 KiB
2025-07-12T23:23:47.529073image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length10
Median length9
Mean length8.7239953
Min length8

Characters and Unicode

Total characters97037
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2022 ?
Unique (%)18.2%

Sample

1st row9/16/2006
2nd row9/1/2004
3rd row11/1/2003
4th row5/1/2004
5th row9/13/2004
ValueCountFrequency (%)
10/1/2005 56
 
0.5%
11/1/2005 53
 
0.5%
9/1/2006 51
 
0.5%
10/1/2006 48
 
0.4%
11/1/2006 40
 
0.4%
8/1/2006 39
 
0.4%
7/1/2004 39
 
0.4%
8/1/2005 37
 
0.3%
7/1/2003 37
 
0.3%
10/1/2004 37
 
0.3%
Other values (3669) 10686
96.1%
2025-07-12T23:23:48.187886image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 22246
22.9%
0 17995
18.5%
1 15687
16.2%
2 13215
13.6%
9 8399
 
8.7%
6 3779
 
3.9%
5 3681
 
3.8%
3 3359
 
3.5%
4 3080
 
3.2%
7 2824
 
2.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 97037
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
/ 22246
22.9%
0 17995
18.5%
1 15687
16.2%
2 13215
13.6%
9 8399
 
8.7%
6 3779
 
3.9%
5 3681
 
3.8%
3 3359
 
3.5%
4 3080
 
3.2%
7 2824
 
2.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 97037
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
/ 22246
22.9%
0 17995
18.5%
1 15687
16.2%
2 13215
13.6%
9 8399
 
8.7%
6 3779
 
3.9%
5 3681
 
3.8%
3 3359
 
3.5%
4 3080
 
3.2%
7 2824
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 97037
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
/ 22246
22.9%
0 17995
18.5%
1 15687
16.2%
2 13215
13.6%
9 8399
 
8.7%
6 3779
 
3.9%
5 3681
 
3.8%
3 3359
 
3.5%
4 3080
 
3.2%
7 2824
 
2.9%
Distinct2290
Distinct (%)20.6%
Missing0
Missing (%)0.0%
Memory size87.0 KiB
2025-07-12T23:23:48.560591image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length67
Median length53
Mean length15.226647
Min length2

Characters and Unicode

Total characters169366
Distinct characters139
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1295 ?
Unique (%)11.6%

Sample

1st rowScholastic Inc.
2nd rowScholastic Inc.
3rd rowScholastic
4th rowScholastic Inc.
5th rowScholastic
ValueCountFrequency (%)
books 2302
 
9.3%
press 1314
 
5.3%
penguin 598
 
2.4%
university 551
 
2.2%
publishing 511
 
2.1%
vintage 409
 
1.6%
352
 
1.4%
classics 344
 
1.4%
company 331
 
1.3%
house 319
 
1.3%
Other values (2015) 17764
71.6%
2025-07-12T23:23:49.198470image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14210
 
8.4%
o 12961
 
7.7%
e 12943
 
7.6%
s 11905
 
7.0%
r 11776
 
7.0%
i 10684
 
6.3%
a 10474
 
6.2%
n 10454
 
6.2%
l 6207
 
3.7%
t 5814
 
3.4%
Other values (129) 61938
36.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 169366
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
14210
 
8.4%
o 12961
 
7.7%
e 12943
 
7.6%
s 11905
 
7.0%
r 11776
 
7.0%
i 10684
 
6.3%
a 10474
 
6.2%
n 10454
 
6.2%
l 6207
 
3.7%
t 5814
 
3.4%
Other values (129) 61938
36.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 169366
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
14210
 
8.4%
o 12961
 
7.7%
e 12943
 
7.6%
s 11905
 
7.0%
r 11776
 
7.0%
i 10684
 
6.3%
a 10474
 
6.2%
n 10454
 
6.2%
l 6207
 
3.7%
t 5814
 
3.4%
Other values (129) 61938
36.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 169366
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
14210
 
8.4%
o 12961
 
7.7%
e 12943
 
7.6%
s 11905
 
7.0%
r 11776
 
7.0%
i 10684
 
6.3%
a 10474
 
6.2%
n 10454
 
6.2%
l 6207
 
3.7%
t 5814
 
3.4%
Other values (129) 61938
36.6%

Interactions

2025-07-12T23:23:38.776452image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:33.968148image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:34.933675image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:35.709412image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:36.685341image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:37.642308image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:38.908552image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:34.111533image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:35.051946image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:35.846495image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:36.816086image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:37.986718image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:39.047514image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:34.244859image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:35.176965image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:35.992282image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:36.962389image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:38.122777image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:39.194453image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:34.377553image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:35.306321image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:36.134260image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:37.100334image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:38.265741image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:39.480101image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:34.534083image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:35.447987image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:36.283225image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:37.251921image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:38.529020image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:39.618258image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:34.789126image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:35.574188image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:36.557336image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:37.511636image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-07-12T23:23:38.653251image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Correlations

2025-07-12T23:23:49.307233image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
num_pagesaverage_ratingbookIDisbn13language_coderatings_counttext_reviews_count
num_pages1.0000.110-0.010-0.1380.0000.1850.168
average_rating0.1101.000-0.0370.0540.1020.0860.032
bookID-0.010-0.0371.0000.0410.050-0.099-0.112
isbn13-0.1380.0540.0411.0000.000-0.252-0.264
language_code0.0000.1020.0500.0001.0000.0000.000
ratings_count0.1850.086-0.099-0.2520.0001.0000.959
text_reviews_count0.1680.032-0.112-0.2640.0000.9591.000

Missing values

2025-07-12T23:23:39.821190image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
A simple visualization of nullity by column.
2025-07-12T23:23:39.998211image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

bookIDtitleauthorsaverage_ratingisbnisbn13language_codenum_pagesratings_counttext_reviews_countpublication_datepublisher
01Harry Potter and the Half-Blood Prince (Harry Potter #6)J.K. Rowling/Mary GrandPré4.5704397859609780439785969eng6522095690275919/16/2006Scholastic Inc.
12Harry Potter and the Order of the Phoenix (Harry Potter #5)J.K. Rowling/Mary GrandPré4.4904393580789780439358071eng8702153167292219/1/2004Scholastic Inc.
24Harry Potter and the Chamber of Secrets (Harry Potter #2)J.K. Rowling4.4204395548969780439554893eng352633324411/1/2003Scholastic
35Harry Potter and the Prisoner of Azkaban (Harry Potter #3)J.K. Rowling/Mary GrandPré4.56043965548X9780439655484eng4352339585363255/1/2004Scholastic Inc.
48Harry Potter Boxed Set Books 1-5 (Harry Potter #1-5)J.K. Rowling/Mary GrandPré4.7804396825849780439682589eng2690414281649/13/2004Scholastic
59Unauthorized Harry Potter Book Seven News: "Half-Blood Prince" Analysis and SpeculationW. Frederick Zimmerman3.7409765406069780976540601en-US1521914/26/2005Nimble Books
610Harry Potter Collection (Harry Potter #1-6)J.K. Rowling4.7304398276049780439827607eng3342282428089/12/2005Scholastic
712The Ultimate Hitchhiker's Guide: Five Complete Novels and One Story (Hitchhiker's Guide to the Galaxy #1-5)Douglas Adams4.3805172269529780517226957eng815362825411/1/2005Gramercy Books
813The Ultimate Hitchhiker's Guide to the Galaxy (Hitchhiker's Guide to the Galaxy #1-5)Douglas Adams4.3803454537439780345453747eng81524955840804/30/2002Del Rey Books
914The Hitchhiker's Guide to the Galaxy (Hitchhiker's Guide to the Galaxy #1)Douglas Adams4.2214000529209781400052929eng21549304608/3/2004Crown
bookIDtitleauthorsaverage_ratingisbnisbn13language_codenum_pagesratings_counttext_reviews_countpublication_datepublisher
1111345617O Cavalo e o Seu Rapaz (As Crónicas de Nárnia #3)C.S. Lewis/Pauline Baynes/Ana Falcão Bastos3.9297223305519789722330558por160207168/15/2003Editorial Presença
1111445623O Sobrinho do Mágico (As Crónicas de Nárnia #1)C.S. Lewis/Pauline Baynes/Ana Falcão Bastos4.0497223299879789722329989por147396374/8/2003Editorial Presença
1111545625A Viagem do Caminheiro da Alvorada (As Crónicas de Nárnia #5)C.S. Lewis/Pauline Baynes/Ana Falcão Bastos4.0997223313299789722331326por176161149/1/2004Editorial Presença
1111645626O Príncipe Caspian (As Crónicas de Nárnia #4)C.S. Lewis/Pauline Baynes/Ana Falcão Bastos3.9797223309779789722330978por1602151110/11/2003Editorial Presença
1111745630Whores for GloriaWilliam T. Vollmann3.6901402315799780140231571en-US1609321112/1/1994Penguin Books
1111845631Expelled from Eden: A William T. Vollmann ReaderWilliam T. Vollmann/Larry McCaffery/Michael Hemmingson4.0615602544169781560254416eng5121562012/21/2004Da Capo Press
1111945633You Bright and Risen AngelsWilliam T. Vollmann4.0801401108799780140110876eng6357835612/1/1988Penguin Books
1112045634The Ice-Shirt (Seven Dreams #1)William T. Vollmann3.9601401319659780140131963eng415820958/1/1993Penguin Books
1112145639Poor PeopleWilliam T. Vollmann3.7200608788279780060878825eng4347691392/27/2007Ecco
1112245641Las aventuras de Tom SawyerMark Twain3.9184976469839788497646987spa272113125/28/2006Edimat Libros